Signal acquisition via polarization modulation in single photon sources 
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A simple model system is introduced for demonstrating how a single photon source might be used 
to transduce classical analog information. The theoretical scheme results in measurements of analog 
source samples that are (i) quantized in the sense of analog-to-digital conversion and (ii) corrupted 
by random noise that is solely due to the quantum uncertainty in detecting the polarization state 
of each photon. This noise is unavoidable if more than one bit per sample is to be transmitted, and 

Q\ , we show how it may be exploited in a manner inspired by suprathreshold stochastic resonance. The 

system is analyzed information theoretically, as it can be modeled as a noisy optical communication 
channel, although unlike classical Poisson channels, the detector's photon statistics are binomial. 

CNj , Previous results on binomial channels are adapted to demonstrate numerically that the classical 

■ information capacity, and thus the accuracy of the transduction, increases logarithmically with the 
square root of the number of photons, N. Although the capacity is shown to be reduced when an 
additional detector nonideality is present, the logarithmic increase with TV remains. 

■ Extensive efforts to establish practical quantum computers incorporating quantum communication and/or cryptog- 
\ raphy have led towards the development of reliable Single Photon Sources (sps) 0,13. While much research in this 

i - 1 ■ area is targeted towards quantum information processing, and the concept of qubits 3| > the impending availability of 
*£h a reliable SPS may also have broader application. 

Here we introduce and analyze a paradigm of quantum photonic information processing that falls beyond the scope 
of classical Poisson light 0, d, H, 0], sub-Poisson squeezed light Q and the usual focus of quantum information 
theory [3|. We consider a model for the transduction of classical information via a quantum optical channel and a 

■ classical detector. 

In our model, an information source directly modulates a SPS that emits photons with equal energy at a known 
~ 1 constant rate. Unlike the Poisson or sub-Poisson statistics of a classical optical source, this results in deterministic 
\ photon release counts. Significant recent experimental progress @, Q indicates that an SPS may soon offer the possi- 
. bility of modulation of the information source onto quantum properties of individual photons, namely the polarization 
angle. Thus the focus of this paper is on quantifying the potential benefits from exploiting such a controllable SPS 
\£> polarization angle in sensing applications. 

^^O We model the information source to be sensed (the 'signal') using the standard information theoretic approach, 

i.e. we treat the signal as a discrete random variable. While our system results in a quantum information channel, it is 
t— I ■ capable of transmitting classical information, as considered, e.g., in [i^lTll]. This is similar to the scenario in (lO. Il2j 
t-H I where two photons are considered in the context of the Holevo bound Q . However it differs in that we specify that an 
arbitrary (but constant) number of photons, N, are used for a single measurement of the signal, where each photon is 
produced independently and unentangled from a SPS. Thus the N photons produced can be considered as a product 
input state [131 ] . 

The second part of the model signal transduction system is a classical photon detector. We assume several standard 
practical constraints, i.e. that the 'receiver' consists of a classical horizontalpolarizer and detector capable only of 
counting the number of photons received in a fixed duration, t s , as in 0, Q. We further assume that the source 
sample period is also equal to t s , so that the source and receiver are synchronized in time. We quantify performance 
information theoretically in bits per source sample or bits per photon. 

While practical photon counters are limited by inefficiency in the conversion of photons to electrons, and the photon 
count is a random variable for a given number that are transmitted [l4l |. we consider only an ideal photon counter. We 
make this assumption because we aim to find the upper limit to the sensing accuracy of the scheme. Other important 
losses in a practical system will lead to lesser performance; our results can be interpreted as a quantification of the 
'best case scenario,' which is the typical utility of information theory. 

We first consider a relatively trivial scheme. Two ways of achieving one bit per source sample in time window t s 
include either transmitting or not transmitting a photon - in which case the polarizer is unnecessary - or to transmit 
either horizontally or vertically polarized photons. This may be viewed as a limiting case of classical on-off keyed 
(OOK) modulation. Clearly, this approach would be limited by the controllability of the SPS, and the readout /reset 
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speeds of the detector. It may not be feasible to readout the photon count at the same rate as photons are produced, 
or it may only be feasible to produce N > 1 equally polarized and unentangled photons during time t s . If only 
horizontally or vertically polarized photons are used, this would be inefficient and redundant, and reduce the rate to 
1/N bits per photon (although remaining equal to tj 1 bits per second.) 

We now consider whether the number of bits per photon can be improved upon if the polarization angle during 
each sample period, t s , can be set to any desired angle between vertical and horizontal. In this case, detection or 
non-detection of transmitted photons is no longer deterministic, and has a probability that depends on the original 
polarization angle. However the actual photon count can be any of the integers in the range 0,..,iV, a fact that can 

be exploited to obtain O ( log j ^ jv - ) ^ rather than l/N bits per photon. 

We assume that in a duration t s the SPS produces N unentangled photons all perfectly polarized to some real-valued 
angle to horizontal, 9. Let the state of a single photon polarized to angle 9 be the qubit 

\ij))=cx3s{d)\H)+sm{e)\V), (1) 

where \H) represents horizontally polarized and \V) represents vertically polarized. 

Let the polarization angle of the detector relative to horizontal be the independent continuous random variable 
$, with support 5$ = [0, a], a G [0, When each photon passes through the polarizer, the probability that it is 
detected, conditioned on 9, and $ = </> is 

p d (0\4>) = cos 2 (0-<£), fle[o,J]. (2) 

Let the density of $ be /$(</>). We define F c {a) — J Q a /$(</>) cos (2<p)d<f> and F s (a) = /$(</>) sin (2<f>)cUf>. From 
Eq. ([2]), the probability of detecting any individual photon with polarization angle 9 is given by 

P d (6,a) = I h{ct>)P d {d\4>)d4> 

= 0.5 (1 + cos (20)F c (a) + sin (20)F s (a)) . (3) 



Let y be the random variable representing the count of photons that propagate through the horizontal polarizer 
after TV are transmitted with polarization angle 9, and let the distribution of source samples have a cumulative 
distribution function (CDF), Fg(6). For N > 1, the channel becomes a binomial channel [15j, since as the photons 
are unentangled, the probability that n are counted at the detector is given by the binomial distribution, 

P y \oto = n\e,a) = (%)P d (6 ) a)»(l-P i (6,a))W-»\ (4) 

where n = 0, .., N. The mean number of photons counted for angle 9 will be N — NP d (9, a). 

The information theoretic limits of the performance of this channel can be found by maximizing the mutual infor- 
mation, I(9;y), between the random variables 9 and y [lfj. The result is the channel capacity, 

C := max I{9;y). (5) 

The units of C and 1(9; y) are bits per source sample, where here each sample results in the emission of N photons 
all polarized to the same angle 9. Converting to bits per photon means dividing C or / by N, and to bits per second 
means dividing by t s . Note the channel of Eq. Q with fixed N is a different kind of photonic 'binomial channel' 
to that considered in [6|, where the number of photons produced at a source is modeled as a controlled discrete 
random variable, N , with different values representing different source symbols, while photon detection probabilities 
are constant. In contrast, here N is a constant and the photon detection probability is modulated by the angle 6. 

Many channel capacity problems include constraints on the source distribution [lq |. For example, [7| considers 
maximum amplitude and average power constraints for standard Poisson light. Here we have both constraints implic- 
itly, via the maximum number of photons per sample, N . Furthermore, we now show that the mutual information 
between 9 and y is equal to that between the variable t := P d (9,a) and y, and that allowing for uncertainty in the 
receiver (i.e. a > 0) is equivalent to placing additional amplitude constraints on t. 

It is straightforward to show that t = P d (9, a) has a single maximum for 9 € [0, 7r/2], 

9°(a) :=0.5arctanf5^Y (6) 
\Fc(a)J 

This attains maximum probability of detection 

t max (a) := Pd{e°, a) = 0.5(1 + ^ F c {af + F s {af) < 1. (7) 
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We also have at the extremes of horizontally or vertically polarized photons Pd(Q, a) = 0.5(1 + F c (a)) and Pd(ir/2, a) = 
0.5(1 — F c (a)), and the minimum probability of detection is i m i n (a) := niin{Prf(0, a), a)} > 0. 

The convexity of t with respect to 9 also implies that for a > some polarization angles induce the same conditional 
output distribution — that is, they produce ambiguous outcomes at the receiver. However, it also means that on the 
interval t £ [iminjimax] there exists a one to one continuous and invertible deterministic mapping t 6. That is, £ is 
a monotonic function of 9. Thus, we must have 1(9; y) — I(t; y), and the channel is equivalent to one with transition 
probabilities 

P ylt (y = n\t) = (?) t n (l - rf N - n \ t e [i mi „(a), Wa)]. (8) 

If polarization angles on the full interval [0,7r/2] are used, then any pair of angles that produce the same t are 
interchangeable, and can be treated as the same input sample. 

The capacity of the binomial channel of Eq. ([5]) (for a — 0, i.e. no constraints on t) has been considered in several 
contexts, [l5|, [Tt], [H, El > and it is known that in the large N limit 

/ iV7r\ 

Jim C = 0.51og 2 — :=Cx>. (9) 



n^oo \2e ) 

This is a lower bound to the capacity of the binomial channel of Eq. ([8]) when N is finite [15| . Straightforward 
adaptation of analysis in (l8l | shows that this asymptotic capacity, Coo , is achieved when 9 is a continuous random 
variable uniformly distributed between and tt/2. 

We now address the problem of finding channel capacity for finite N and a > 0. Since we have a discrete memoryless 
channel, with an output consisting of N+l states, capacity is achieved when only a finite number of angles, M < N+l, 
are used (lrl Corollary 3, pp. 96]. This means we can label the capacity achieving input distribution for the channel 
of Eq. (|8|) as a probability mass function, with real valued support given by {ti, .., t m , .., ijvf}j and mass values 
{Pg(l), .., Pg(m), .., Pq(M )}. Since the transition probabilities are dependent on the support points, the capacity 
problem can be written as 

C= max I(t;y), (10) 

{M,P e (l),..,P e (m),ii,..,t M } 

where M > 1, X)m=i ^M m ) = 1 and t m G [0, 1]. Although mutual information is convex in the probabilities, Pg(m), it 
is not convex in t and solving using standard gradient descent methods does not guarantee a global optima. However, 
several numerical approaches to solving such capacity problems exist. These all rely on the following property. Define 

i(tm) ■= Y.n= P(n\t = t m )log 2 W here P y (n) = £* f =1 P e (m)P y]t=t Jn\t = t m ). By [E Theorem 4.5.1], 

a necessary and sufficient condition for achieving channel capacity for any given set of fixed t m of size M is that 

f =C Vi m s.t. P e (m)>0 

Consequently capacity is achieved if and only if all maxima of the function i(t) are achieved by a support point 
t = t m and all support points achieve a maxima. This fact allows convex optimization techniques to be used, thus 
guaranteeing global convergence [l5|, [l9| . We used CVX, a package for specifying and solving convex programs ■ 

The result of such a numerical optimization for this channel is shown with a solid line in Fig. [U for the case 
where the detector's angle is deterministic (a = 0). The results are in agreement with [l^, The capacity as 
a function of a, for various values of N is shown in Fig. [2] for uniformly distributed detector angles. In this case 
imax(a) = 0.5 (1 + sin (a) /a) and t m i n (a) = 0.5 (1 — sin (2a) /2a). While clearly decreasing as a increases (which can 
be thought of as increasing detector noise) the capacity still increases with increasing N, and is larger than one bit 
when N = 63 even at a = 0.5-7T. 

Fig. Q] also shows two known lower bounds to the capacity. The first, Coo (given by Eq. ([9])) significantly under- 
estimates the true capacity. The second, given in closed form in Eq. (13) in [191 ] and denoted here as II, is much 
tighter. In the photonic binomial channel context, it is equivalent to the mutual information that results by ensuring 
the source polarization angle has a mixture distribution that is absolutely continuous and uniformly distributed on 
(0, -|), and has discrete mass points at angles and ^, each with some probability Q < 0.5. This means that while 
the capacity is achieved by a purely discrete distribution, nearly the same performance can be achieved using analog 
modulation over all polarization angles. 

Further verification can be obtained as follows. If an arbitrary output distribution P y (n) is chosen, the global 
maximum of i(t) is an upper bound to capacity [2lj ] . Finding a close upper bound requires a judicial choice of P y (n). 
As an example, let 

f TT97§ n\ n = 1 N 

r>U ( \ J , , l+2(-R-Q) , 10 . 

{ ff (i+2(fl-q)) n=l,..,JV-l. 



4 



where B{-, •) is a beta function, Q = ^^,t"' 5 A , and R = J Following [2l| we write 



v^FT(JV+l) ' 

N 

I I'lll I \ \ 

(13) 



... I ' P{n\t) 
I{x; y) < Iu ■= max > P(n\t) log 2 

4 WW 



where P(n|i) is given by Eq. ([8]). The upper bound Ijj can be easily calculated numerically and is shown in Fig. [T] 
and as can be seen, it verifies the numerical calculation of capacity. 

Our results mean that the achievable mutual information increases with N as 0(0.5 log 2 (N)) — c.f. Eq. ©. However 
since the number of bits per photon decreases to zero as N increases, exploiting single photon polarization is of 
potential benefit in terms of bits per second only when the source sample does not change before more than one 
identically polarized photon is emitted. This fact may also be exploited in a hypothetical multi-photon source that 
releases N identically polarized photons simultaneously. In this case, capacity is increased in an analogous way to 
utilizing multilevel pulse amplitude modulation (PAM) in conventional communication systems. The difference is 
that the choice of the number of PAM amplitudes depends on channel noise, whereas for the SPS it depends on N, 
and indirectly on the quantum uncertainty in polarized photon detection. 

Indeed, the model described here can be thought of as benefitting from randomness, in a similar sense to suprathresh- 
old stochastic resonance (SSR) 0, [H. The model of 0, S3 has the same channel as Eq. (01, except Pd{9 1 a) is 
replaced by the CDF of additive noise. Mathematical equivalence (for a = 0) follows, since photon detection is a 
random process equivalent to the corruption of 9 by additive random noise, ij S [— ? , f ] , with probability density 
function f v (ri) = cos (2ry), followed by binary threshold detection such that the output is \H) when 9 + rj < j. How- 
ever, unlike |18l . [22j], here SSR cannot be observed since the randomness is not due to noise, and the variance of ?y 
cannot be changed (SSR might instead be observed if sub-optimal input signals were used). Nevertheless, it is only 
the uncertainty in the polarization state of each photon that allows an N state quantization (digitization) of each 
sample, rather than a binary outcome. While a noisy signal results, like SSR this is offset by gaining more than one 
bit per source sample. 

A significant outcome of our analysis is that we have shown that continuously distributed polarization angles 
can achieve performance close to capacity. Thus, the system may be highly suitable for the transduction of analog 
information sources. Indeed, if each photon is thought of as a "noisy sensor," our hypothetical system bears similarities 
to several macroscopic frameworks studied in the signal processing literature — i.e. the "refining sensor network" of 23], 
or the "stochastic pooling network" of [191 ]. 
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Number of photons per sample, N 



FIG. 1: Channel capacity, C, for a perfect detector (a = 0), as a function of increasing number of photons per source sample, 
N (N is defined only for integer values, but a solid line is used as an aid to visibility). Also shown is Coo, i.e. the lower bound 
to the channel capacity given by Eq. (|9}, which asymptotically approaches capacity as N — + oo, a better lower bound to the 
channel capacity, II (described in the text), and an upper bound, Ixj- 
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Maximum detector angle, a 



FIG. 2: Channel capacity for various N as a function of the maximum angle, a, of a randomly varying (uniformly distributed 
tp) angle in the detector's polarizer (i.e. increasing detector noise). 



